首页> 外文OA文献 >Quickest Detection with Social Learning: Interaction of local and global decision makers
【2h】

Quickest Detection with Social Learning: Interaction of local and global decision makers

机译:社交学习中最快的检测:本地和全球的互动   决策者

摘要

We consider how local and global decision policies interact in stopping timeproblems such as quickest time change detection. Individual agents make myopiclocal decisions via social learning, that is, each agent records a privateobservation of a noisy underlying state process, selfishly optimizes its localutility and then broadcasts its local decision. Given these local decisions,how can a global decision maker achieve quickest time change detection when theunderlying state changes according to a phase-type distribution? The paperpresents four results. First, using Blackwell dominance of measures, it isshown that the optimal cost incurred in social learning based quickestdetection is always larger than that of classical quickest detection. Second,it is shown that in general the optimal decision policy for social learningbased quickest detection is characterized by multiple thresholds within thespace of Bayesian distributions. Third, using lattice programming andstochastic dominance, sufficient conditions are given for the optimal decisionpolicy to consist of a single linear hyperplane, or, more generally, athreshold curve. Estimation of the optimal linear approximation to thisthreshold curve is formulated as a simulation-based stochastic optimizationproblem. Finally, the paper shows that in multi-agent sensor management withquickest detection, where each agent views the world according to its prior,the optimal policy has a similar structure to social learning.
机译:我们考虑本地和全局决策策略如何在停止时间问题(例如最快的时间更改检测)中进行交互。个体代理通过社会学习做出近视局部决策,即每个代理记录对一个嘈杂的基础状态过程的私人观察,自私地优化其本地效用,然后广播其本地决策。给定这些本地决策,当基础状态根据阶段类型分布变化时,全球决策者如何才能最快地检测时间变化?本文介绍了四个结果。首先,利用布莱克韦尔测度的优势,表明基于社会学习的最快检测所产生的最优成本始终大于经典最快检测所产生的最优成本。其次,表明基于贝叶斯分布空间内的多个阈值通常表征了基于社会学习的最快检测的最优决策策略。第三,使用晶格规划和随机支配,给出了充分的条件,以使最佳决策策略由单个线性超平面或更一般地由阈值曲线组成。对该阈值曲线的最佳线性逼近的估算被公式化为基于仿真的随机优化问题。最后,本文表明,在具有最快检测速度的多智能体传感器管理中,每个智能体都根据其先验情况观察世界,最优策略的结构类似于社会学习。

著录项

  • 作者

    Krishnamurthy, Vikram;

  • 作者单位
  • 年度 2012
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号